AI-Based Syntactic Pattern Recognition of Sequences

نویسندگان

  • Ghada Badr
  • John Oommen
چکیده

This patent concerns the traditional problem encountered in the syntactic Pattern Recognition (PR) of strings or sequences. The primary investigator involved in this work is a Full Professor at Carleton University in Ottawa, Canada, and is a Fellow of the IEEE. The primary problem solved by the invention involves determining the string or sequence that is most similar to a sequence presented to the system. The search could be initiated by presenting, to the system, a noisy or inexact version of a string contained in memory for example, at a web-site or in the library or database. The invention will yield the closest string/sequence by searching the dictionary of possible words using a newly invented AIbased strategy. The core of this invention is this search strategy, called the Clustered Beam Search. Experiments have been done to show the benefits of the CBS over the current state-ofthe-art, and the results demonstrate an unbelievably marked improvement (sometimes as high as 90%) for large libraries and databases. The solution provided by the invention would be applicable in numerous areas including : Inexact or proximity searching on the Internet, keyword-based search in libraries and databases, spelling correction, speech and character recognition (including optical character recognition), and the processing of biological sequences, for example, in human genome projects. These applications are briefly described below. ∗This author can be addressed at: School of Computer Science, Carleton University, Ottawa, Canada : K1S 5B6. e-mail address : [email protected]. †Professor and Fellow of the IEEE. This author can be contacted at: School of Computer Science, Carleton University, Ottawa, Canada : K1S 5B6. e-mail address : [email protected]. More information about this inventor, who holds a Doctorate from Purdue University, can be found at www.scs.carleton.ca/∼oommen.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

AI and Pattern Recognition

This panel deals with the relationships between artificial intelligence and pattern recognition. It addresses such issues as: How do the fields differ? How should they relate to one another? In particular, does AI have anything to learn from PR? Pattern recognition, classically, involves the extraction of features and their subsequent analysis in feature space using statistical tools or cluster...

متن کامل

Steel Consumption Forecasting Using Nonlinear Pattern Recognition Model Based on Self-Organizing Maps

Steel consumption is a critical factor affecting pricing decisions and a key element to achieve sustainable industrial development. Forecasting future trends of steel consumption based on analysis of nonlinear patterns using artificial intelligence (AI) techniques is the main purpose of this paper. Because there are several features affecting target variable which make the analysis of relations...

متن کامل

Syntactic Pattern Recognition Systemfor Dna

We review both theoretical and practical results of a linguistic approach to studying the structure of features of DNA sequences. Using generative grammars, complex assemblages can not only be described and analyzed abstractly, but also concretely, such that features can be searched for by a general-purpose parser. Our parser, called GenLang, uses an extended logic grammar formalism and has fou...

متن کامل

Integrating AI with Sequence Analysis

This chapter will discuss one example of how AI techniques are being integrated with, and extending, existing molecular biology sequence analysis methods. AI ideas of complex representations, pattern recognition, search, and machine learning have been applied to the task of inferring and recognizing structural patterns associated with molecular function. We wish to construct such patterns, and ...

متن کامل

A Novel Approach to Conditional Random Field-based Named Entity Recognition using Persian Specific Features

Named Entity Recognition is an information extraction technique that identifies name entities in a text. Three popular methods have been conventionally used namely: rule-based, machine-learning-based and hybrid of them to extract named entities from a text. Machine-learning-based methods have good performance in the Persian language if they are trained with good features. To get good performanc...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005